Big Graph Mining: Frameworks and Techniques
نویسندگان
چکیده
Big graph mining is an important research area and it has attracted considerable attention. It allows to process, analyze, and extract meaningful information from large amounts of graph data. Big graph mining has been highly motivated not only by the tremendously increasing size of graphs but also by its huge number of applications. Such applications include bioinformatics, chemoinformatics and social networks. One of the most challenging tasks in big graph mining is pattern mining in big graphs. This task consists on using data mining algorithms to discover interesting, unexpected and useful patterns in large amounts of graph data. It aims also to provide deeper understanding of graph data. In this context, several graph processing frameworks and scaling data mining/pattern mining techniques have been proposed to deal with very big graphs. This paper gives an overview of existing data mining and graph processing frameworks that deal with very big graphs. Then it presents a survey of current researches in the field of data mining / pattern mining in big graphs and discusses the main research issues related to this field. It also gives a categorization of both distributed data mining and machine learning techniques, graph processing frameworks and large scale pattern mining approaches.
منابع مشابه
Distributed Data Processing Frameworks for Big Graph Data
Recently we create so much data (2.5 quintillion bytes every day) that 90% of the data in the world today has been created in the last two years alone [1]. This data comes from sensors used to gather traffic or climate information, posts to social media sites, photos, videos, emails, purchase transaction records, call logs of cellular networks, etc. This data is big data. In this report, we fir...
متن کاملGraph Pattern Mining for Business Decision Support
To which extent can graph pattern mining enrich business intelligence? This question was the seed whose sprout became my PhD research. To find an answer, I investigated graph-based data integration, the calculation of business measures from graphs and suitable data mining techniques based thereon. The latter should identify correlations between occurrences of specific graph patterns and values ...
متن کاملAn Efficient Multi-Dimensional Data Analysis over Parallel Computing Framework
In the era of big data where data is growing double by it's size over year and year. So it is very difficult to handle and process the massive amount of data. Data storage and data handling should be done in real time and without loss of data. Cloud computing resolves the problem of storage and availability for data analysis task. Big data and parallel computing frameworks comes into picture wh...
متن کاملData Science for Social Good - 2014 KDD Highlights
As the premier international forum for data science, data mining, knowledge discovery and big data, the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) brings together researchers and practitioners from academia, industry, and government to share their ideas, research results and experiences. Partnered with Bloomberg, it celebrated its 20 years in 2014 with the theme “Data Sc...
متن کاملBig Data Clustering: A Review
Clustering is an essential data mining and tool for analyzing big data. There are difficulties for applying clustering techniques to big data duo to new challenges that are raised with big data. As Big Data is referring to terabytes and petabytes of data and clustering algorithms are come with high computational costs, the question is how to cope with this problem and how to deploy clustering t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Big Data Research
دوره 6 شماره
صفحات -
تاریخ انتشار 2016